COVID-19 RELATED DATA ANALYSIS¶

PROJECT OVERVIEW¶

Current days the entire world has the same goals: define the reasons of COVID-19 pandemic starting, propogate coronavirus vaccination and analyze all the factors which can make COVID-19 cases to go to zero. As those goals look rather impressive they can be achived through the regular analysis of COVID-19 statistics including new cases, fatality. testing etc. The goal of this project is analysis of real-life COVID-19 data, defining factors which are important for virus propogation, hospitality and fatality rates.

This the final data used in analysis¶

country year_week country_code tests_done population testing_rate positivity_rate dateRep cases_weekly deaths_weekly
0 Belgium 2020-11 BE 9924 11455519 86.630732 17.482870 16/03/2020 1735 31
1 Belgium 2020-12 BE 17066 11455519 148.976227 27.036212 23/03/2020 4614 265
2 Belgium 2020-13 BE 27141 11455519 236.925101 31.465311 30/03/2020 8540 874
3 Belgium 2020-14 BE 38426 11455519 335.436570 25.737782 06/04/2020 9890 1671
4 Belgium 2020-15 BE 53517 11455519 467.172199 18.270830 13/04/2020 9778 1999
... ... ... ... ... ... ... ... ... ... ...
860 Sweden 2020-49 SE 261230 10230185 2553.521759 14.218122 07/12/2020 37142 446
861 Sweden 2020-50 SE 270944 10230185 2648.476054 16.016225 14/12/2020 43395 526
862 Sweden 2020-51 SE 299447 10230185 2927.092716 15.971107 21/12/2020 47825 379
863 Sweden 2020-52 SE 232114 10230185 2268.913026 16.840432 28/12/2020 39089 237
864 Sweden 2020-53 SE 232114 10230185 2268.913026 17.762823 04/01/2021 41230 103

649 rows × 10 columns

What is the best way to compare testings?¶

[Text(0, 0.5, 'Testing rate'),
 Text(0.5, 0, 'Week of the year'),
 Text(0.5, 1.0, 'Testing rate per 100 000 population')]

Plot describes that during first few weeks Malta had the highest testing rate across all countries, probably because it has the smallest population rate. But most of the observed period Denmark was a leader by testing rate. This number increased obviously and the highest testing rate (more than 14000) observed at the 51th week of 2020 year.

Is there correlation between testing rate and positivity rate?¶

the correlation between  testing_rate and positivity_rate  can be as following:

*************************************************************************

strong postive correlations 0.8423 for Czechia
strong postive correlations 0.6266 for Estonia
strong postive correlations 0.8957 for Latvia
strong postive correlations 0.6383 for Malta
strong postive correlations 0.9171 for Slovenia
******************************************************

strong negative correlations 0.1920 for Belgium
strong negative correlations -0.1945 for Denmark
strong negative correlations 0.0827 for France
strong negative correlations -0.0026 for Ireland
strong negative correlations 0.0951 for Sweden
******************************************************

poor correlations 0.4348 for Germany
poor correlations 0.5511 for Greece
poor correlations 0.2156 for Italy
poor correlations 0.2447 for Netherlands
poor correlations 0.3638 for Portugal

What is an association between new cases and deaths?¶

the correlation between  cases_weekly and deaths_weekly  can be as following:

*************************************************************************

strong postive correlations 0.8750 for Czechia
strong postive correlations 0.6234 for Denmark
strong postive correlations 0.8862 for Estonia
strong postive correlations 0.7983 for Germany
strong postive correlations 0.7400 for Greece
strong postive correlations 0.6029 for Italy
strong postive correlations 0.9642 for Latvia
strong postive correlations 0.7722 for Malta
strong postive correlations 0.8858 for Portugal
strong postive correlations 0.6459 for Slovenia
******************************************************

******************************************************

poor correlations 0.2827 for Belgium
poor correlations 0.3194 for France
poor correlations 0.2299 for Ireland
poor correlations 0.3105 for Netherlands
poor correlations 0.3313 for Sweden

Which country descibes the highest number of deaths?¶

To compare countries by their geographical positions we can use interactive map.

2.PNGCapture.PNG

Which is the deaths variability across countries?¶

As each country has different population level we estimated fatality rate - number of deaths per population level. Italy still keeps the leader position by median fatality rate and maximum fatality rate. High median levels of fatality observed for france,Belgium, Greece and Latvia also. The lowest median fatality belongs to Estonia, Denmark, Malta and Portugal have the lowest variability of fatality.

CONCLUSIONS¶

This project describes end-to-end data analysis of COVID-19 related data, including data uploading, filtering, reshaping, transformation and visualization. The main aspects of the project are relevant to the weekly data of 2020 statistics by 15 countries: Greece Malta,Italy,Portugal,Latvia,Denmark,Czechia ,Sweden ,Belgium,France, Netherlands, Slovenia, Estonia,Germany,Ireland
Along the study there were found that Denmark describes the highest testing rate along 2020 year. There were assosiation between testing rate and positivity rate some countiers is strong postive like Croatia some strong negative like Cyprus and some poor like Italy . like At the same time weekly deaths are positively associated with weekly new cases at the high level. Italy described the highest number of deaths along 2020 with more than 75k defined cases. It also has the highest fatality rate among 15 countries used for the analysis.

FUTURE WORK¶

All further steps relevant this project are oriented for the 2021 data analysis, including factors of vaccination. As different virus stamps have been defined during last month it is important to include this information to the further investigations of COVID-related data.

REFERENCES¶

  1. How ECDC collects and processes COVID-19 data. https://www.ecdc.europa.eu/en/covid-19/data-collection

  2. Sources - Worldwide data on COVID-19. https://www.ecdc.europa.eu/en/publications-data/sources-worldwide-data-covid-19

  3. Data on hospital and ICU admission rates and current occupancy for COVID-19. https://www.ecdc.europa.eu/en/publications-data/download-data-hospital-and-icu-admission-rates-and-current-occupancy-covid-19

  4. Data on testing for COVID-19 by week and country. https://www.ecdc.europa.eu/en/publications-data/covid-19-testing

  5. Data on the weekly subnational 14-day notification rate of new COVID-19 cases. https://www.ecdc.europa.eu/en/publications-data/weekly-subnational-14-day-notification-rate-covid-19